THE DISTRIBUTION OF SPACINGS BETWEEN 
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Abstract. We study the distribution of spacings between squares 
modulo q, where q is square-free and highly composite, in the limit 
as the number of prime factors of q goes to infinity. We show that 
all correlation functions are Poissonian, which among other things, 
implies that the spacings between nearest neighbors, normalized to 
have unit mean, have an exponential distribution. 
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1. Introduction 



Our goal in this paper is to study the distribution of spacings (or 
gaps) between squares in Z/gZ, as q — > oo. In the case that q is prime, 



a theorem of Davenport P, f|, O, [L8j shows that the probability of two 



consecutive quadratic residues modulo a prime q being spaced h units 
apart is 2~ h , as q — > oo. For our purposes, we may interpret this result 
as saying that when we normalize the spacings to have unit mean, then 
the distribution of spacing as q —>■ oo along primes is given by 

oo , 

P(a) = jV\y(a - -) 

h=l 

that is, a sum of point masses at half-integers with exponentially de- 
creasing weights. 

In this paper we study the spacing distribution of squares modulo q 
when q is square-free and highly composite, that is the limiting distribu- 
tion of spacings between the squares modulo q as the number of prime 
divisors, u(q), tends to infinity. For odd square-free q the number N q 
of squares modulo q equals 

p\q 

This is because if p is an odd prime, the number of squares modulo p 
is (p + l)/2 and for q square- free, x is a square modulo q if and only if 
x is a square modulo p for all primes p dividing q. Thus for odd q, the 
mean spacing s q = q/N equals 

2^(<?) 



s 



i 



For q = 2q' even and square-free, it is easily seen that s q = s q r. It 
follows that s q — > oo as uj{q) — >• oo, unlike the case of prime q where 
the mean spacing is essentially constant. Thus, unlike in the prime 
case where the level spacing distribution was forced to be supported 
on a lattice, in the highly composite case there is an a-priori chance of 
getting a continuous distribution. 

A relevant statistical model for the distribution of spacings is given 
by looking at random points in the unit interval R/Z. For independent, 
uniformly distributed numbers in R/Z, the spacing statistics are said to 
be Poissonian. The distribution P(s) of spacings between consecutive 
points will be that of a Poisson arrival process, i.e. P{s) = e~ s (see 
0). Moreover, the joint distribution of k consecutive spacings is the 
product of k independent exponential random variables. 
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It is well known [jHJ that the spacing statistics of the superposition 



of several independent spectra converges to the Poisson case - the spac- 
ings statistics of uncorrelated levels. Thus the heuristic that "primes are 
independent" together with Davenport's result indicates that the spac- 
ing statistics of the squares modulo q should in the limit as u(q) — » oo 
be Poissonian, i.e., that in some sense squares modulo q behaves as 
random numbers. It is our purpose to confirm this expectation. 

In order to study the level spacings, we proceed by studying the 
r- level correlation functions. These measure clustering properties of 
a sequence in R/Z on a scale of the mean spacing. Their definition 
and their application to computing various local spacings statistics are 
recalled in appendix [A[ In our case, these turn out to be given by by 
the following: For r > 2 and a bounded convex set C C R r_1 , let 

Rr(C, q) = -jrrih{ x i distinct squares mod q : (xi—Xz, . . . x r -i—x r ) G sC}. 

This is immediately transformed into 

(1.1) R r {C,q) = ^ N M 

where N(h, q) is the number of solutions of the system of congru- 
ences y i+ i — Hi = hiYno&q with y x , y 2 , . . . y r squares modulo q and 
h = (h±, . . . h r -i) G Z r_1 . 

To compute the correlations for distinct Xi we consider only sets C 
which a-priori only contain vectors (sc— Xt+i) with distinct coordinates. 
To do this, we define "roots" on R r_1 for i < j by o~ij(h) = Ylk=i ^k- 
The hyper-planes {a^ = 0} C R r_1 are called "walls", and (xj — x i+ i) 
does not lie in any of the walls if and only if all coordinates X{ are 
distinct. 

Our main result shows that if C does not intersect any wall then 
R r (C, q) — > vol(C) for any sequence of square-free q with u(q) — ► oo: 

Theorem 1. Let q be square-free, r > 2 and C C R r_1 a bounded 
convex set which does not intersect any of the walls. Then the r- level 
correlation function satisfies 

R r (C, q) = vol(C) + 0(s- 1/2+e ) as s -> oo 

for all e > 0, where s is the mean spacing. 

This theorem implies that all spacing statistics are Poissonian (see 
Appendix ^). For instance, if we denote by s\, . . . , sn-i the normalized 
differences between neighboring squares, then we have 
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Theorem 2. For q square-free, the limiting level spacing distribution 
of the squares modulo q is given by P(t) = exp(— t) as u(q) — > oo. 
Moreover, under the same condition, for any k > 1 the limiting joint 
distribution of (s n , s n+ \, . . . , s n+ k) is a product Yli=o ex P( - ^) of k + 1 
independent exponential variables. 

There are only a few known cases where the complete spacing dis- 
tribution can be proved to be Poissonian as in our case. A notable 
example is Hooley's results |7|, ||, ||, [UJ that the spacings between ele- 



ments co-prime to q are Poissonian as the mean spacing q/<p(q) — > oo. 
A much more recent result is due to Cobeli and Zaharescu who 
show that the spacings between primitive roots modulo a prime p are 
Poissonian provided the mean spacing pj ' <fi(p — 1) — ► oo. 

The results of this paper are related to work on the level spacing 
distribution of the fractional parts {an 2 } (a irrational) by Rudnick, 
Sarnak and Zaharescu I7|. In particular, in |T7J an attempt to 



study that problem is made by replacing a with a rational approxi- 
mation b/q, and this leads to study the spacings of the sequence bn 2 
mod q, 1 < n < N for iV a small power of q. The available sites 
are exactly the set of squares modulo q, and hence our interest in the 
problem. 

In fl7 |) it is shown that in order for all the correlation functions 



of the sequence {an 2 } to have Poisson behavior, it is necessary to 
assume that the rational approximants b/q have denominator q which 
is close to square-free. Hence our interest in the square-free case. For 
arbitrary q it is still true that all correlations are Poissonian, but there 
are significant technical complications to overcome in proving this, see 
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We believe that the methods developed in this paper should be useful 
in studying similar problems, for instance the spacing distribution of 
cubes modulo q, as the number of prime factors of q that are congruent 
to 1 modulo 3 tends to infinity. (The condition modulo 3 is necessary 
in order for the mean spacing to go to infinity.) 



Contents of the paper: We begin with a section sketching the argu- 
ment for Theorem [1] in the case of the pair correlation function. This 
section can be used as a guide to the rest of the paper. 

In section |3| we first reduce the problem to the case that q is odd. 
Then in section ^ we analyze the behavior of N(h,p) where p is prime. 
Squares that are distinct modulo q are not necessarily distinct modulo 
p; we denote by r e ff(h) the number of squares that remains distinct 
after reduction modulo p. Using an inclusion-exclusion argument we 
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write r e ff(h) as a linear combination of characteristic functions of cer- 
tain hyper-planes over Z/pZ. Next, in section |5] we use the multiplica- 
tive properties of the counting functions N(h, q) to derive an expression 
for R r (C,q) as a sum over divisors c of q and lattices L arising from 
intersections of hyper-planes modulo p for different p's (proposition ^j. 

In section |6] we show that the main term of the sum consists of those 
terms for which the product of c and the discriminant of L are small 
with respect to s, and an error term corresponding to terms where the 
product is large. In section [7|we evaluate the main term and show that 
it gives us exactly vol(C) , thus giving us our main result. 

In appendix |A| we explain how to use Theorem [I] to derive results such 
as Theorem |2|, that the level spacings are Poissonian as well. Appen- 
dix P explains some background on counting lattice points in convex 
sets. In appendix we estimate the number of divisors of q that are 
smaller than a fixed power of the mean spacing s. 
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2. The pair correlation - a sketch 

In order to explain the proof of our main theorem [I], we give an 
overview of the argument in the special case of the pair correlation 
function. 

Let q be an odd, square-free number with w(q) prime factors, and 
/ an interval, not containing the origin. Define as in the introduction 
the pair correlation function 

hesinz 

where N is the number of squares modulo q, s = q/N = 2 w ^^/ cr -i(?) 
is their mean spacing, (J-i(q) = Yl p \ q (^ + p)> an d N(h, q) is the number 
of solutions in squares modulo q of the equation 

Ux - V2 = h mod q 

We will sketch a proof that R2(I,q) — > |/| as u(q) — > oo (|/| being the 
length of the interval). In fact we have the more precise result 

Theorem 3. For q odd, square- free we have for all e > 

R 2 (I,q) = \I\+0(s- 1+ *) 

Here are the main steps in the argument: 
Step 1: 

By the Chinese Remainder Theorem, N(h,q) = Ylp\ q N(h,p) is a 
product over primes dividing q. By elementary considerations, one 
sees that 

(2.1) N(h, P ) = i±^A(h,p) 
with a(h,p) = 0(1) and 

A(h,p) = l + 6(h,p), 6(h,p) = 
From this we see that 

(2.2) w(MH i^M E 4M 

c\q 

with a(/i,c) := H p \ c a(h,p) < c € and A(h,q) = H plq A(h,p). 
Step 2: 

We decompose A(/i, g) = A(/i, c)A(h, -) and rewrite A(/i, -) as 

A(/i,|)=JJ(l + «(M) = E ,5 (' 1 ^) 
p|f all 



p Jh 



V 



h 
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with 

8(h,g) = 

Substituting this into the expression (|2.2| ) for N(h, q) and inserting the 
result into the formula for B,2{I, q), we get 

( 2 - 3 ) MI>Q) = - lf0 L (g) E^E E <h,c)A{h,c) 

1{qJ c\q g\i hesIngZ 

Step 3: 

We partition the sum into two parts, one over the pairs g, c with gc < 
s and the leftover part over pairs with gc > s. We will show this leftover 
part is negligible (in fact 0(s~ 1+e )): We first use a(h,c)A(h,c) <C c e 
and the fact that in order for the inner sum over h to be nonempty, we 
need g <C s (recall that / does not contain the origin!) to get that the 
sum over pairs with eg > s is bounded by 

c\q ff|f c|, 5 



Now we use Lemma [XSj which shows that the number of divisors g < s 
of q is a most 0(s e ) and Lemma O to bound the above by 



l+e 



as promised. 
Step 4: 

For each pair of c, g with eg < s, we first treat the inner sum over 
h E si n g7i. We break it up into sums over + 0(1) subintervals 
[y, y + eg) fl gZ plus a leftover term of size at most c 1+e . For each 
subinterval, we use periodicity of a(h, c)A(h, c) under h t— > h + c to find 

c 

£ a(/i, c)A(/i, c) = £ a^/ij, c)A{ghx, c) 

he[y,y+cg)DgZ h 1= l 

Because g is square-free, and g divides q/c, we have that g, c are 
coprime. Therefore we can change variables h = gh\ to get that this 
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last sum equals 

E a{h,c)A{h,c) = Y[ <h,p)A(h,p) 

h mod c p\c h mod p 

We evaluate the sum J2h modp°(^'P)^(^'P) by noting that sum- 
ming (|2.1| ) over h mod p, the sum of the LHS is simply the number of 
all pairs of squares modulo p, namely (p + l) 2 /4. This gives 

E a(h,p)A(h,p) = p + 1 

h mod p 

Thus the inner sum over h G si H gZ equals 



E <K c)A(h, c) = ( ^1 + 0(1)) H(p + 1) + Oic 1 ' 

<7_l(c)+0(c 1+e ) 



hGsIrigZ v 3 p|c 



Step 5: 

Inserting this into the expression ( |2.3| ) for R 2 (I,q) gives 



Now we extend the sum to all pairs g, c, to find that up to an error 
of 0(s~ 1+e ) we have 

i? 2 /, q) ~ / ^E E - 



= |,| 1 £i = |,| 

which is what we need to prove our theorem □. 

In the following sections, we will repeat these steps with full details 
for the higher correlation functions, where several technical complica- 
tions arise. 
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3. Reduction to odd q 

We first show that in Theorem [I] it suffices to consider only the case 
of q odd: Suppose that q = 2q' with q' odd and square-free. We recall 
that 

(3.1) jRr(C ,g) = I N M 

q hGsCnZ'- 1 

where N(h, q) is the number of solutions of the system y i+ i — y% = 
hi where yi, y 2 , ■ ■ ■ y r are squares modulo q and h = (hi, . . . /v-i) £ 
(Z/gZ)- 1 . 

By the Chinese Remainder Theorem, the number N q of squares mod- 
ulo q is the product 

N q = N 2 N q , = 2N q , 
Therefore the mean spacing s q := q/N q is given by 

/ x V q' 

(3 ' 2) s " = 2iVV = = s « 

Moreover, again by the Chinese Remainder Theorem, 

N(h,q) = N(h,2)N(h,q') 

and since all residues modulo 2 are squares, we have N(h, 2) = 2. Thus 
we find 

N(h,q) = 2N(hW) = N(h,q') 
1 ' ' N q 2N q , N q , 

Inserting Q into (|PD, we find that 

i2r(C,g) =Rr{C,q') 

This shows that it suffices to prove Theorem ^] for g odd, which we 
assume is the case in the sequel. 
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4. The prime case 

Let p > 2 be a prime. For h = (hi, . . . /i r _i) G (Z/pZ) r-1 , we define 
N r (h,p) to be the number of solutions in squares y^ mod p (including 
yi = 0) of the system 

(4.1) yi-y i+1 = hi mod p, 1 < i < r — 1 

This number depends crucially on the number of distinct yj. For each 
h = (hi, . . . , h r -i), we define r e ff(h) to be the number of distinct ?/,• 



(not necessarily squares) satisfying the system ( f4.1|) . Since the solu- 
tions of the homogeneous system yi — y i+ i = mod p are spanned by 
(1, . . . , 1), r e ff(h) is well-defined (independent of the particular solu- 
tion y of Q). 

We define roots &ij(h), 1 < i < j < r by 

j-i 

(4.2) ^(/i) = 

k=i 

so that (7j > j + i(/i) = hi, Oij = a k,k+i- The solutions of (|4.1|) are all 
distinct of and only if &ij(h) ^ 0, for all i < j, since 

3-1 3-1 

y* ~ yj = Vk ~ Vk+i = s ^2 hk = vvV 1 ) 

k=i k=i 

Proposition 4. Let r e ff(h) be the number of distinct y^ in a solution 



/a q\ » r /, x P + a(h,p) 
( 4 -3) N r (h,p) = — — 

wzt/i a(h,p) <C r p 1//2 : . 

Proof. The case r e ff(h) = 1 is precisely when ft, = and all yi are 
equal: yi = y% = ■ ■ • = y r . In this case the number of solutions is the 
number of squares modulo p, namely (p+ l)/2, which is of the desired 
form. We thus assume from now that r e ff(h) > 1. 

We first reduce the system ( |4.1| ) to a system of r e // — 1 equations 
in r e // variables: If r e ff(h) is the number of distinct in a solution of 
(|4.1j ) (independent of y\), then we can eliminate some of the equations. 
Renumber the variables so that yi, ■ ■ ■ yr eff are the distinct coordinates 
of a solution, and for all j > 1, y reff +j equals one of these, then the 
system (|4.1|) is equivalent to the reduced system 



(4.4) yi - y i+x = h\ mod p, 1 < % < r eff - 1 
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(where the h[ are renumbered hj to give that the first r e ff coordinates 
are distinct). So we need to find the number of solution of the reduced 
system (JO]). 

We first eliminate those solutions where at least one of the i/j is zero. 
In this case, since the system ( |4.4| ) (considered as a linear system) 
has rank r e ff — 1 in r e ff variables, specifying any one of the variables 
determines all the others, hence the number of solutions with some 
coordinate zero is at most r e //. Thus we need only count solutions 
where all coordinates yi are nonzero. 

To every such solution in squares yi 7^ mod p, write yi = xf 
mod p with ^ mod p. There are precisely two such solutions, 
namely ±£j mod p. Thus the number of possible Xj corresponding to 
a given solution y of fl4.4|) is precisely 2 re -ff , and the number of nonzero 
solutions of the reduced system ( fl4|) with yi squares modulo p is exactly 
\/2 Te ff times the number of solutions of the system 

(4.5) xl — xf +1 = h[ mod p, 1 < i i < r e // — 1 

with Xi mod p. By adding back at most r solutions we can remove 
the condition xi ^ 0, and then we find that 

(4.6) N r (h,p) = -±-n(ti,p) + O r (l) 

where n(h',p) is the number of solutions of 

%i — = b! i mod p 1 < i < r e //(/i) — 1 
This is just the number of solutions (t ) of the system 

(4.7) x\ = t-b u xl = t-b 2 ,..., x 2 Teff = t - b reff 

with 6i = 0, b 2 = h[, 6 3 = h[ + h' 2 , . . . , b T . eff{h) = h[ + h' 2 + h 

h' i and in general b^ = cr^ (/;/). Note that the bi are distinct - 
this is equivalent to the requirement that the solutions of the reduced 
system ( |4.4D be distinct. One can now use the "Riemann Hypothesis 
for curves" |T| (see Schmidt's book |19| , Chapter II, Theorem 5A and 
Corollary 5B for the case bi = —l,b 2 = —2, . . . b r = — r), to find 

(4.8) \n(ti,p) -p\ < r eff 2 r ^^/p 
In addition, \N(h,p) — n(h' ,p) /2 r ' eff \ < r and so 

*(M = p + a(M 

with 

a{h, p) < 2 r *ft (r e// Vp + r) < r y/p 
This proves Proposition ^. □ 
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4.1. A formula for r e ff(h). Our next order of business is to give a 
formula for r e ff(h). We begin with some combinatorial background: 
A set partition of the set {1,2,..., r} is a collection of disjoint subsets 
F = [F 1 ,...,F t ], Fi C {l,2,...,r}, whose union is all of {1,2, ...,r}. 
We set |F| = t, the number of subsets in F_. 

To each set partition F, we associate a subset Vp of afline r-space 
V = A r by setting 

(4.9) Vf_ = {s E A r : Si = Sj if i,j are in some 
Correspondingly, in H = A 7 " -1 we have a subspace 

(4.10) Hp = {he A r_1 : (7y(/i) = if i,j are in some F fe } 

Under the map n : V — > /J taking s = (s,) i— > (sj — Sj+i), we have 
Vf = tt- 1 H f _. 

There is a partial ordering on the collection of all set-partitions of 
{1, . . . , r} with F_ -< G if and only if every Fj is contained in some Cr,-. 

For example, O = [{1, 2, ... , r}] is the maximal element of this par- 
tial ordering, with \0\ = 1 and Ho = (0). The minimal element is 
r = [{1}, {2}, . . . , {r}] with |r| = r and fZj. = A r_1 . 

The partial ordering on set-partitions is inclusion-reversing on sub- 
spaces: F^G^VfDVg^HfDHg- 

The regular part of Vp is 



and likewise we define Hp. Then Hq = Hg_ = (0), and every h belongs 
to a unique Hp for some F_. We thus have 



= {s E Vp_: Si j£ Sj if i, j are not in some F k } 



H 



F 



and likewise 



F-<G 

We can now give a formula for r e //(/i): 
(4.11) r e// (/i) = dimV> = dim%+ 1 = |£| 

where F_ is the unique set-partition such that h G i/^. 
We can write this as follows: Define 



(4.12) 




he Hp 

otherwise ' 




otherwise 
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Then 

(4.13) r eff (h) = J2dMVF)Sl(h) 

F 

Similarly 

(4.14) A(/i,p) ■= 2 r " r «"W = Y, 2C ° dim(V - S l( h ) 

F 

It will be convenient to express this in terms of the characteristic func- 
tion 5p_ of the subspaces Hp_. For this we use Mobius inversion. Since 
the collection of all set-partitions of {l,...,r} is a partially-ordered 
set, it has a Mobius function fi(F,G) which is the unique function so 
that for any functions t/>, on set-partitions satisfying 

(4.15) </>(£) = J>(£) 

F^G 

we have 

(4.16) V(£) = X>(£,G)<KG) 

F-<G 



An explicit form of n(F,G) can be found in [l4[, §25. We will not have 
any use for it. 

In our case, clearly we have Hp = \J F ^ G Hq so that 



(4.17) *£=£*3 
Thus we have 

(4.18) *£= J>(££)*G 

This gives us the formula for A(h,p) = 2 r_r<!// ^- ) : From Q4.14|) and 
we find 

(4.19) A(M = Z^fcW 

G 

with 

— !i(/i ; ^): ,c< 

F-<G 



(4.20) X(G) = MZ,G)2 codir 



For use in Section |7], we need to know the sum of the product of 
A(h,p) with the error term a(h,p) in ( [4.3| ) over all vectors h: 
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Lemma 5. 

£ a(h, P )A(h,p) = (p + iy- P r J2m)p~ codimH ^ 

h mod p G 

Proof. We have by definition 
so that 

a(h,p)A(h,p) = 2 r N(h,p) -pA(h,p) 
Now sum over all h mod p: The sum of N(h, p) is just the total number 
of r-tuples of squares modulo p, namely ( £ rr-) r . To sum A(h,p) over 
h, we use fl4.19| ): Since the sum over all h of Sc(h) is just the number 
of vectors in the subspace Hg_, namely p dimH G = pr-i-codim*/^ we g nc [ 

a(h,p)A(h,p) = (p + l) r -p A ^'P) 

h mod p h mod p 

= ( P + ly -p r Y^ KG)p' codimHa 

G 

as required. □ 
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5. A FORMULA FOR R r (C,q) 

In order to prove Theorem [|, we give an expression ( |5.2|) for the r- 
level correlation R r (C, q) which involves summing over the intersection 
of the dilated set sC with various lattices. 

Recall that for each set-partition G of {l,...,r} we associated a 
subspace Hg_ C (Z/pZ) r_1 . Now given a divisor d mod q, let Q = 
be a tuple of such set-partitions, one for each prime p dividing 
d (recall that q, hence d, is square-free). Let L{Q) C Z r_1 be the pre- 
image of Y\ p \ d H G ( P ) under the reduction map Z r_1 — > ]l p | d (Z/j9Z) r_1 ~ 

(Z/c?Z) r_1 . L{Q) is a lattice, whose discriminant (that is, the index in 
Z r ~ l ) is 

disc(^=n/° dim( ^ (p)) 

P \d 

The support supp(^) of L(Q) is the product of all primes p for which 
flew ^ (Z/pZy- 1 -- 

supp(£) Yl P 

Since codim(if G ( P )) < r — 1, we get 

supp(^) | disc(^) | supp^)^ 1 

We set 

A(^)=n A ^ (p) ) 

P \d 

where A(G) is given by ( f4.20| ). We also set for a divisor c | q 
a(h,c):=Y[a(h,p), A(h, c) := J[ A(h,p) 

p\c p\c 

Note that by Proposition |] 

(5.1) a(h,c) <c 1/2+e , A(/i,c)<c e 

for all e > 0. 

Our formula for R r (C, q) is 

Proposition 6. The r-level correlation function is given by 

(5.2) R r ( C ,q) = ^Y,- c E A @) E <h,c)A(h,c) 

c\q supp(e)|f hesCnL(Q) 
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Proof. We have that 



N 

h^scnzr- 1 



By the Chinese Remainder Theorem, 



N(h,q) = Y[N(h,p) 
pW 



We rewrite formula (14.31) in the form 



where 

A(h,p) = 2 r - r ^f {h) 

Thus we find 
(5.3) 

N(h ) - qA ( h > q "> V a{h,c) _ q ^ q a(h,c)A(h,c) 
\ 'V/ 2 TU) ^ J c 2 ru) ^ J c c 

c\q c\q 

Inserting ( |5.3| ) we get a formula for R r (C, q): Recalling that N = q/s, 
(5.4) R r (C, q) = -±-^J2- c J2 A ^ *M*> c ^ h > c ) 

c\q h£sC 



Next we use the expression ( 4.16 ) for A(h,p) to write A(h, 2 ) 
rip| 9 /c A(h,p) in the form 



(5.5) A(h, 1) = J2 m [p) )Kh, G^) = A (W> Q ) 



where the sum is over all tuples of set-partitions Q = ®r)\3.Q^ p \ one for 
each prime dividing 2 , and we put for each such tuple Q 

and 

^^^TT^^^I 1 modpforallplf 
*f otherwise 
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This is the characteristic function of the lattice L(Q) whose support 
supp(^) divides q/c. Thus we get the desired expression for R r (C,q) 

R r(C, q ) = ^J2~ c E A (£) E *(h,c)A(h,c) 

c\q supp(e)|f hesCnL(g) 

□ 
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6. Evaluating the t-level correlations 

In order to estimate the correlations using Proposition^, we partition 
the sum ( |5.2| ) into two parts: the first consisting of pairs c and Q such 
that cdisc(^) < s, and the second of the pairs for which cdisc(^) > s. 
We will show that the first part gives the main term and the second is 
negligible. 

6.1. The case cdisc(£) > s. We use a(h,c) < c l l 2+e |Tl]), and 
A(h,c) <C c e to see that this term is bounded by 

(6-i) E \KG)\#{sCnL{g)} c v^ 

c\q supp(S)|f 
cdisc(C?)>s 

By the Lipschitz principle (Lemma |16|), 

#KnL(5)}«^|| + s - 

and since vol(sC) = s r_1 vol(C), we find that 

(6.2) #{sCn L( e)} « 3 |l J + s - 

Moreover, in order that sC D L(Q) ^ 0, we will see that we need 
supp(^) <C s r ( r - 1 )/ 2 ; since C does not intersect the walls. This is a 
consequence of the following observation: Let C C R r-1 be a bounded 
convex set. Define 

r-l 



diam^C) = max{^ \xk\ : x E C} 



k=l 



Note that diamx scales linearly: diam^sC) = sdiam^C) for all s > 0. 

Lemma 7. //supp(^) > diam 1 (sC) T ^ r_1 ^ //2 then sCnL(Q) is contained 
in the walls {h e R r_1 : o~ij(h) = for some i < j}. 

Proof. Let dij(Q) be the product of the primes p such that vanishes 
on H G ( P ), i.e. so that 

o~ij{x) = mod p for all x G L{Q) 
Then dij{Q) \ supp(^) and moreover we claim that: 

disc(0)| Y[dij{g) 

i<j 
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It is enough to check this one prime at a time and is equivalent to 
saying that 

codim(#£( P )) < #{z < j : = on H^ p) } 

which follows since H G ( P ) is given by vanishing of some of the cr^ . 

Now note that if supp(^) > d T(j, ~ 1 ^ 2 then for some i < j, d^Q) > d 
because supp(^) < disc(C/) < Yli<j dij{G) and the last product consists 
of r(r — l)/2 factors. If we take d = diam^sC) = sdiami(C), then 
one has dij(Q) > diamx (sC) for some i < j. However <Jij(h) = 
mod dij(Q) and so &ij{h) = mdij(Q) for some integer m. If m = then 
h lies in a wall. If m ^ then being an integer, |m| > 1 and so 

Wij{h)\ > dij(Q) > diam^sC) 

Since 

j-i j-i r—l 

a ij( h ) = \y^ J h k \<y^ J \hk\ < ^2 i^i 

k=i k=i k=i 

we find that 

r-1 



2^ \ hk\ > diami(sC) 

k=i 

Thus h j£ sC by definition of diami (sC). □ 
By Lemma [?|, together with |A(C/)| <C supp(£) e , ( |6.1| ) is bounded by 

(«) ^E«-^ E -pp(e) e (a|Jy+^) 

clfl supp(5)|f V V ^ 7 

cdisc(C7)>s 
supp(e)<s r(r " 1)/2 

We split the sum into two parts and use s < 2"^ to bound ( |6.3|) by 
the sum of 

(6.4) -j2c- i/2+ - Yl sn ^ g y 



s ^— ' disc(^) 

c\q BUpp(0)|f V 7 

cdisc(C7)>s 



and 



(6.5) l^V 1 /^ £ supp^ 



s 

c|q supp(g)|f 
cdisc(CJ)>s 
supp(g)<s r ( r - 1 )/ 2 
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We begin by noting that the number of Q with supp(£?) = g is 0(g e ), 
i.e., 

(6.6) Yl 1<< ^> 

supp(5)=9 

Since we sum over supp(^) <C s^- 1 )/ 2 i n (|6.5|) , we have supp(£) e <C 
s e , and thus Q6.5Q is bounded by 

-E c ~ 1/2+e E /«^ 1+e E c " 1/2+e E L 

9<<s r(r-l)/2 g<s r(r-l)/2 

By Lemma 18, the number of divisors of q/c which are less than s r ( r_1 )/ 2 
is at most s e , so this term is bounded by 

s-1+e E c " 1/2+e - 

c\q 

Since 

c\q p\q ' p\q 

the contribution of ( |6.5| ) is at most 0(s~ l+e ). 

It now remains to bound ( |6.4| ) • We first consider the terms for which 
csupp(^) > s. Now, disc(^) > supp(^), so if csupp(^) > s then 
certainly cdisc(^) > s, and sum of the corresponding terms in ( |6.4| ) is 
bounded by 

s 



csupp(CJ)>s 

c|g 3|f supp(g)=g c\q g\& 

cg>s cg>s 

by (|6.6|) . Changing variable to d — eg, which is a divisor of q satisfying 
d > s, this is bounded by 

c -l/2+e i 

EE (d/c) 1 -^ = E ^i-e E 1 

d]g c\d w 7 d|<? c|d 

d>s a!>s 

Now the sum X]c|d c ^ 2+e * s bounded by r(d)d 1 ^ 2+e < rf 1 / 24 "^, so the 
above is bounded by 

d\q 
d>s 



c l/2+e 
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by Lemma [L^. This bounds the contribution of c, Q with c supp(£?) > s. 

If cdisc(£?) > s then dis ^ < c. This, together with (|6.6|) implies 
that 

E -pp( 6 )«^ 

c| 9 supp(ff)|f ^ 

c disc(5)>s 
csupp(C?)<s 

« I ^ g e « ^ 1/2+e E E 1 



< S 



-1/2+e 



2 



E 1 

\c<s / 

since ^c|? 1 *C s e by lemma [19]. Consequently ( |6.4j ) is 0(s~ 1/,2+e ). 
(Note that we only used supp (G) < s 7 '^- 1 )/ 2 to bound 



6.2. The case cdisc(^) < s. Fix c > 1 and Q and partition the 
lattice points in sC H L(Q) into two subsets as follows: Fix a reduced 
fundamental cell (see |B.1| ) P = P{Q) for the lattice L = L(Q). Then 
cP is a reduced fundamental cell for the dilated lattice cL. We can tile 
R r_1 by the translates h c + cP, h c G cL. 

Definition 6.1. We say that x G L fl sC is c-interior if there is some 
y E cL so that a; G y + cP C sC. We say that x G LflsC is a c-boundary 
point otherwise. 

Note that the notion depends on c and on the choice of a fundamental 
cell P for L. 

An important fact is that if dist (x, d(sC)) ^> r cdisc(L) then x is 
c-interior. This follows from Lemma [15] since diam(cP) <C r cdisc(L). 

Lemma 8. Let P be a fundamental cell for the lattice L C Z 7 '^ 1 , c > 1 
so t/iat gcd(c, disc(L)) = 1. Then for y G cL, the intersection L fl (y + 
cP) with L of the translate of the dilated cell y + cP consists of a full 
set of representatives of Z r_1 /cZ r_1 . 

Proof. If P = {Y7j=\ x j^j : < Xj < 1} then the c r_1 lattice points 

y + Y7j=\ n j^v n j = 0, 1, . . . , c — 1 in L fl y + cP are clearly inequiva- 
lent modulo cL, and are the only points of L in this intersection. We 
will show that if gcd(c, disc(L)) = 1 then they are inequivalent modulo 
cZ r_1 . To see this, it suffices to show that L fl cTI~ x = cL. By the 
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theorem on elementary divisors, there is a basis {e}} of Z r ~ x and inte- 
gers dj > 1 so that {djSj} is a basis of L, and disc(L) = n^=i 4r ^ 
x G L fl cZ r_1 then x = X]j=i m jdj^j G £ and also x = c n,,e} G 
cZ r_1 . Comparing coefficients we find 

(6.7) m i^i = m i> j = 1 , . . . , r — 1 

Now since dj | disc(L) and gcd(c, disc(L)) = 1, we have that gcd(c, dj) = 
1 and so ( |6.7| ) shows that rrij = mod c and x G cL. □ 

Lemma 9. a) The number of points y of cL so that y + cP C sC is 

vol(sC) +0 ^£^-2 



disc(cL) V c 
6j The number of c- boundary points of L is <C cs r_2 . 

Proof. If y = cz £ cL then ?/ + cP C sC if and only if z G L fl -C and 
z + P C 2C. Thus we need to count JV := #{2 GLHfC:z + PC 
^C}. An upper bound is obtained by a packing argument - since the 
translates z + P are disjoint and contained in -C, we get 

iVvol(P) < vol(-C) 

c 

and so 

vol(f C) s r ~ l vol(C) 



(6 ' 8) ^ " disc(L) c— 1 disc(L) 

For a lower bound, note that if z G L fl satisfies dist(z, <9(^C)) > 
diam(P) then z + P C -C. By the Lipschitz principle (Lemma p!^) and 
Lemma [17], the number A" of such points is 

^ = vol{^GfC:dist(x,g(fC))>diam(P)} + q , 

disc(L) 



Further, 



vol{a; G -C : dist(x, <9(-C)) > diam(P)} = vol(~C)+0 fdiam(P)(-^" 2 



and so 

~ vol( £ C) /diam(P)(^) r - 2 * A vol( £ C) /,s, , 
disc(L) \ disc(L) c / disc(L) V 



c 



because diam(P) <C r disc(L). 

Since N > N, together with the upper bound Q6.8|) we find 

-gg+°(0'- 2 
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b) For the number of c-boundary points, we subtract the number of 
c-interior points from the total number of points of L fl sC. The total 



number of points in L fl sC is by the Lipschitz principle (Lemma [16]) 



(6.9) Lna C=^£l + 0(8< 

disc(L) 



To count the number of c-interior points, we can write each uniquely 
as y + p, with y as in part a and p G L fl cP. Now #(L fl cP) = c r_1 
(see Lemma Bl) and so by part a, the number c-interior points is 



Subtracting (|6U0|) from (g]| gives part b. □ 



Fix Q, c > 1 with cdisc(^) < s. Note that since q is square-free and 
supp(^) | 2, we have gcd(c, disc(^)) = 1. We now estimate the sum 

J2 a(h,c)A(h,c) 
heL(g)nsC 

We divide this into two sums, E int over the c-interior points and 
over the c-boundary points. We use a(h, c)A(h, c) -C c l ^ 2+t to bound 
S bd by: 

#{c-boundary points}c 1/2+e < cs r '- 2 c 1/2+e = c 3/2+e s r ~ 2 

The contribution of the c-interior points is computed by writing each 
such h as h = y + h with h G cP fl L and y G cL fl sC. For each 
?/ we get all possible h , which run over a full set of representatives of 
Z r_1 /cZ r_1 since gcd(c, disc(^)) = 1 (Lemma §). Denote the number 

of such y by iV; by Lemma || part a, iV = ^(il) +0 ((f) r ~ 2 )- Moreover 



a(|/ + fr , c) A(y + fr , c) = a(/i 0) c) A(/i , c) 
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since y G cL(Q) C cZ 7 " -1 . Thus 



Emt = N ^ a(h , c) A(h , c) 

fco mod c 

= /vol(|C) , £ 

Idkii IV 

x fto mod c 

vol(*C) 

= dWZy E a(^c)A(^,,c)+0 

/io mod c 

= dRz( E a(/»o,c)A(ho, C )+0( 

/it; mod c 



^2 a(h ,c)A(h ,c) 



r -2 c r-i c i/ 2+e 



c 3/2 + , s r- 2l 



Thus the total contribution of the pairs with cdisc(^) < s is 



(6-11) 



E; E E «(ma(m 



2M<?) ^ c 

c|g supp(g)|f 
cdisc(CJ)<s 



/iGsCnL(g) 
vol(sC) 



2i-a%) 



Ej E E <^c)A(^c 



cdisc(C?)<s 



+ 



feo mod c 

/ \ 

^E; E |A(ff)k 3/2+ « S r - 2 

c\q supp(e)|f 
cdisc(5)<s 



To estimate the error in ( |6.11| ), note that the condition cdisc(^) < 
s implies csupp(^) < s since supp(^) < disc(^), so for an upper 
bound we may replace the summation over pairs satisfying the former 
condition by the sum over pairs satisfying the latter; this gives (noting 
that 2 W («) > s) 



E^ E ia(^)|c 3 / 2+ v- 2 « s -^^ c 1 /^ Yl \^)\ 



c\q supp(£0|f c\q supp(g)|f 

cdisc(Q)<s csupp(5)<s 



«s-^j2 cl/2+e H E 



c|<? g|f supp(0)=g 

cg<s 



Now |A(£)| <C supp(^) e and the number of Q with supp(^) = g is 
0(g e ), which is 0(s e ) since g < eg < s, so that the above is bounded 
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by 

c\Q g\i 
cg<s 

The number of small divisors g of - with g<s/c<sisat most s e , so 
the above is at most 

s- 1+e J2 cl/2+6 < s -1+e s 1/2+e #{c I q : c < s} < S - 1/2+e ' 

which gives that the error term in (|6.11|) is 0(s^ 1 ^ 2+e ). 

We now extend the sum of the first term in ( |6.11| ) to all the pairs c, Q, 



introducing an error which was bounded in section |6.1| by 0(s 1 / 2+e ). 
(This is the term (|6.4|) which was bounded without using the condition 
supp(£) < s r ^' 2 .) 

In summary we find that 

Proposition 10. 

(6.12) 



2"»(q) L-j c ^ c r - 1 disc(L) 

c\q supp(C?)|2 ho mode 



+ 0(s 



-l/2+e\ 
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7. The main term 
We now treat the main term of (|6.12|) . Define 

M = ^w>E- c E w^^n E <^)A M 

c\q supp(5)|2 h mode 

we will show 

M = vol(C) 

which with ( 6.12 ) will prove Theorem |l[ 
The sum over h mod c is multiplicative: 

J2 a(h,c)A(h,c)=Y[ J2 <h,p)A(h,p) 

h mod c p\c h mod p 

Furthermore, by Lemma [5] 

h mod p C?( p ) 

Now note that since p codimH GW — disc(G^) we get 



- codim H 



g(p) 



2 ~fe) ^ disc(5)^ ^ ^discfG^) 



Further, 



V A(g) _ nv A(g«) 



supp(£?)|f Pi? G< p > v — 7 



Therefore we find that 



■M = voi(c)^^vnv A( - P ^ TTia + V-r^TT 

= vol(C)^— V A(^)B(c) 
V_i(g r V c 



THE DISTRIBUTION OF SPACINGS BETWEEN QUADRATIC RESIDUES 27 

Thus M. is a multiple of the Dirichlet convolution of the multiplicative 
functions A, B, with A(l) = B(l) = 1, 



^ dtac(G«) 

and (since (1 + l/p) r = a-\{p) r ) 

(7.1) B(p) = a-iipY - A(p) 

Now we have 

(A*B)(q) := J>(^)5(c) = U(A(l)S(p)+A(p)B(l)) = U^pf 

c\q p\q p\q 

by ( pM|) . This finally gives the main term of R r (C, q): 

M = vo\(C)—^--(A * B)(q) = vol(C)— ^^(gr = vol(C) 
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Appendix A. Recovering the level spacing from the 

CORRELATIONS 

In this appendix, we explain how to recover the various spacing dis- 
tributions from the correlation functions. This is well-known in the 
physics literature (e.g. and is certainly implicit in Hooley's work 
H |10[ , but we do not know of a good source for it in the mathemat- 
ical literature. A very detailed treatment of this and more will appear 
in a forthcoming book by Katz and Sarnak jjEJ . 

We begin with R/Z which we think of as the circle with unit circum- 
ference. We denote by {x} the fractional part of a;: If n < x < n + 1, 
n integer, then {x} = x — n. We set 




{x} < {x} < \ 
{x} - 1 § < {x} < 1 



We will order the points in R/Z counter-clockwise and write x >- y 
if the points lie in a segment of length < 1/2 on R/Z and x follows 
y. The signed distance on R/Z is given by ((x — y)); thus —1/2 < 
((x — y)) < 1/2. In terms of the signed distance, x y y if and only if 
((•'• //))•('. 

Given a finite set S of N points on R/Z, and k > 2, the k-level 
correlation functions measure clustering properties of the sequence S C 
R/Z on a scale of the mean spacing 1/N: For a fc-tuple of points 
x = (xi, . . . , Xj.) of 5*, the oriented distance vector is 

(A.l) D(x) = ((On - x 2 )\ • • • , ((x fe -i - x k ))) 

Given a bounded set C C R fc_1 , we define the k-level correlation as 

R k (C,S) = ^{xeS k -.D(x)e^C} 

As an example, let A fc_1 C R fc_1 be the standard open simplex 

fc-i 

A^ 1 = {(j/i,...2/ fc -i) \y i >0,J2vi< 1 } 

i=i 

and for t > set C = tA^ 1 . Then if > 2t, D(x) E ±C = ^A^ 1 
means that 

1. ((xj — Xj + i)) > 0, that is xi >- £2 >- • • • >- 

2. The points all lie in an arc of length at most t/N. 

As another example, write k — 1 — i + j and for t 1 ,t 2 > set 
C = £iA l x £2^"'; which we can write as 

C = {(yi, ■ ■ ■ yk) ■ Vm > 0, yi + y 2 H h yi < h, y i+1 H h < t 2 } 
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Then D{x) G j^C iff Xi >- x 2 >- • • • >- x k and x±, . . . ,x i+ i lie in an 
arc of length < t±/N, and x i+ ±, . . . , x i+ j + \ = x k lie in an arc of length 
<t 2 /N. 

Given any subset T C S which is contained in a semi-circle, the 
ordering gives us unique initial and final elements of T, and we can 
write T = {x init = x\ -< x 2 -< ■ ■ ■ -< Xf in }. We denote by |T| the 
number of elements of T, and by diam(T) the distance dist(xj n j t , Xfi n ) 
between the initial and final points of T. If T consists of just the initial 
and final points, we say that T is a consecutive pair. A consecutive 
k-tuple of S is a fc-tuple of elements X\ = x init -< ■ ■ • -< x k = Xf in so 
that there are no points of S between Xj and Xj+i, for 1 < j < k. 

For x < 1/2, let N k (x) be the number of /c-tuples of diameter smaller 
than x; this is zero if k ^> 1. It is clear from the definitions and the 
discussion above that we can describe these functions in terms of the 
correlation function of the simplex xA k ~ 1 by 

(A.2) Rk(xA k -\S) = ^N k (^) 

Further, let g(x) be the number of consecutive pairs of diameter less 
than x, that is the number of spacings between consecutive elements of 
S of length less than x. We may express g in terms of an alternating 
sum of iVVs as follows: 

Lemma 11. With g and N k as above, we have for x < 1/2 

g(x) = J2(-±) k N k (x)- 

k>2 

Moreover, for all n > 1, we have the inequalities 

2n+l 2n 

J2(-i) k Mx)<g(x)<J2(-V kN ^ 

k=2 k=2 

Before giving the proof, we will need the following elementary lemma 
on sums of binomial coefficients. 

Lemma 12. Let m > be an integer. Then ^™ (— I) 1 (7) = unless 
m = 0, in which case the sum equals 1. Moreover, 

2n+l / \ m / \ 2n / 

Proof. The first part is just the binomial expansion of (1 — l) m . As for 
the second part, if m > 1 use the identity (™) = ( m ~ 1 ) + to find 

Eto(- 1 )'(T) = (- 1 ) fc (T x ) from which the claim follows. □ 
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We can now prove lemma [H} 

Proof. For each pair T = {a >- b}, of diameter less than 1/2, we 
associate X T , the set of all z-tuples x\ y . . . y Xj in S such that 
(x\,Xi) = (a,b). The set of all tuples of diameter less than x is 
thus expressed as a disjoint union of the X T 's as T ranges over all 
pairs of diameter less than x. If we let Nf be the number of i- 

tuples in X T then A^ = Y, T N i- But = ( |T t~ 2 )' so b y lemma 



^ i>2 (— l) l A^ T is zero unless T is a consecutive pair, in which case the 
alternating sum is one. Summing over all consecutive pairs we get that 
g(x) = X^fc>2( — • Lemma [T2| also gives that for n > 0, 

2n+l 2n 

£(-iW ^ B-W ^ B-W 

i=2 i>2 1=2 

Summing over all T we get the second assertion. □ 

A.l. The joint level spacing. An (i, j)-tuple of diameter (x, y) is an 
{i + j)-tuple X\ y . . . y Xi y x i+ i y . . . y x i+ j (all lying in an arc of 
length < 1/2) such that dist(xj,Xi) = x and dist(x i+ j, Xj) = y. 

For i > 2, j > 1 and x + y < 1/2 we let N it j(x,y) be the number 
of (i,j)-tuples of diameter at most (x,y). Let g(x,y) be the number 
of consecutive triples X\ y X2 y X3 of diameter smaller than (x, y) . 
Analogously to lemma |TT] we have: 

Lemma 13. If we let A k (x,y) = ^2 i+ j =k Ni t j(x,y), then 
s(x,y)=£(-l)* + %(x,y). 

k>3 

Moreover, for n > we have the inequalities 

3+2n+l 3+2n 

£ (-l) k+1 A k (x,y)<g(x,y) < £ (-l) k+1 A k (x, y). 

k=3 k=3 

Proof. For each triple T = {a y b y c} of diameter at most (x,y), 
let X T be the set of (i, j) -tuples xi y . . . y Xj y x i+ i y . . . y x i+ j 
such that (xi, Xj, Xi+j) = (a,b,c), and let Njj be the number of 
tuples in Xt- We may write the set of (i, j)-tuples of diameter smaller 
than (x,y) as a disjoint union of Xy's, as T ranges over all (2,1)- 
tuples with diameter at most (x,y). Given T, we may count tuples 
of type in Xt as follows: Let M,N be the number of elements 
of S between a, b and b, c respectively (we allow both M and A" to be 
zero.) Then iVj = (£)(£). Moreover, ^ = £^= fc JVS = C^) 
since there are ) ways of choosing — 3 objects out of M "blue" 
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and iV "red" objects. By lemma [12|, we see that J2 k>3 (— l) fc+1 AjT = 
*l2k>3(~^) k+1 d-3) * s zero umess T is a consecutive (2, l)-tuple, in 
which case it is one. Now, lemma 12 together with A\ = shows 
that 

3+2n-l 3+2n 

E (-!) fe+1 ^ < £(-i) fc+1 ^ ^ E(- 1 ) fe+1 ^- 

fe=3 fe>3 fc=3 

Summing over all triples T of diameter at most (x, y) we are done. □ 
A. 2. Applications to squares mod q. We let 

Tl 

S a = {— : < n < q — 1, n a square modulo g} C R/Z 

be the image in R/Z of the set of squares in Z/gZ. The mean spacing 
between elements of S q is l/N q , where is the number of squares 
modulo q. For x > 0, g q {-&-) is the number of consecutive pairs in S q of 
diameter at most x/N q , that is the number of normalized consecutive 
spacings of length < x, and we set 

P{x) = lim w g q {^r) 

This is the limiting proportion of normalized consecutive spacings in 
S q of length at most x (this normalization sets the mean spacing to be 
unity). P(x) is the cumulant of the level spacing distribution P(s) of 
the introduction. Likewise we set for x, y > 0, 

P(x,y) = lim — <7o(— -, — ) 

V '^^ q^oo Nq^ N q ' N q 

the cumulant of the joint level spacing distribution. 

For a bounded convex set C C R fc_1 , not intersecting the walls, 
and AT ^> 1, jj-C will be contained in the cube (— 1/2, l/2) fc_1 . For 

x = - G S*, (0 < rii < q are squares modulo q) the oriented distance 

vector D(x) (see ( A.l )) will lie in 4-C if and only if there is an integer 

vector h G -^C n Z fc_1 so that 

Xi — Xi+i = hi mod q, 1 < i < k — 1 

Denoting by AT(/i, g) the number of solutions of the above system 
in squares modulo q, we have found that the correlation function 
Rk(C,q) := Rk(C,S q ) satisfies 

(A.3) i4(C, g ) = J_ N(h,q) 

q /igscnz*- 1 
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with s = q/Ng. 

Lemma 14. If x,y > then 

P(x) = l-e~ x 

and 

P(x,y) = (l-e-*)(l-e- y ) 



Proof. As noted above (see (|A.2f) ), we can express the functions N k (x) 
in terms of the c 
whose volume is 



in terms of the correlation functions associated to the simplex xA k 

fc-i 



(fc-l)!- 

R k (xA k - 1 ;q) = ±-N k (^-) 

From theorem [I] we know that 

R k (xA k -\ q) = x*- 1 voKA^ 1 ) + O k (s-V 2+e ) 

Jfe-1 



(k-l)\ 

By lemma |TT] we see that for n > 0, 



l+2n+l i n (_x_ 

Y (-l)^<liminf^ 



and 



^ i a \ l+2n 

y?v nJ . / . w . 1 a; 



lim sup ^ 



+ 1' 



Letting n — ► oo and noting that the above polynomials are truncations 
of the Taylor series of 1 — e~ x we are done. 

For the second part of the lemma, recall that Nij(x, y) is the number 
of ordered z+j-tuples of elements of S q such that the first i are contained 
in an interval of length x, and the last j elements lie in an interval of 
length y. Thus, analogously to ( |A.2| ), N i; j(x,y) is a scaled version of 
the (i+j — l)-correlation with respect to the convex set xA l ~ l x yA^: 



-R l+j {x^xyA^q). 



By Theorem |], 

m 

since 



Ri+jixA*- 1 x y£j- q) = f V + O^^) 



VD^sA*" 1 x yA J ) = - ' y 
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Letting A k (x,y) = J2 i+j=k N id (x,y) and using lemma |12|, we get 

-, 3+2n ;_i 

um.up^ »)<E(-d-E x v 



and 



i>l 
j>0 

3+2n j 

V / y , - <lirninf — gJ— ,^-) 

i>o 

Since the above polynomials are truncations of the Taylor series for 
(1 — e~ x )(l — e~ y ), we are done. □ 
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Appendix B. Some Geometry of Numbers 

B.l. Given a basis l\ . . . , 4 of a lattice L in R", the fundamental cell 
is the half-open set 

P({£i}) := {xJi + --- + x n e n :0<Xi<l} 

It serves as a fundamental domain for the action of L on R" by transla- 
tions. The volume of P({£i}) is the discriminant disc(L) of the lattice 
L: 

vol(P({5») = | det(4 . . . ,4)1 = disc (£) 

B.2. We need the following basic fact (due to Mahler and Weyl) from 
reduction theory: In any dimension n > 1, there are constants < c' n < 
d' n so that any lattice L C R n has a basis l\ . . . , 4 which is reduced in 
the sense that 

(R1) c - - ^cTZT - n 

This is a consequence of Minkowski's second theorem on successive 
minima; see (Cassels Lemma V.8, p. 135) or (Siegel [pOfl , X §6). 
This basis is not unique in general. 

B.3. We define the diameter diam(L) of the lattice L to be the mini- 
mum of the diameters of all fundamental cells for L. 

Lemma 15. The diameter of an integer lattice L C Z n is bounded by 
the discriminant of L: 

(B.2) diam(L) <„ disc(L) 

the implied constant depending only on the dimension n. 

Proof. It suffices to show that if P({£i}) is the fundamental cell of an 
integer lattice L C Z n with respect to a reduced basis {£{}, then the 
diameter of P({£i}) is bounded by the discriminant of L: 

(B.3) diam(P({f,})) < n disc(L) 

To see this, note that since L C Z™ is an integer lattice, the length 
of any non-zero vector in L is at least 1, and then this implies that a 
reduced basis has bounded eccentricity: 

(B.4) 1 < |/i| < |4| < • • • < 141 < C disc(L) 

(assuming we ordered the basis vectors according to their length). In- 
deed, using (|B.1| ) together with \£i\ > 1 we get an upper bound for the 
longest basis vector £ n 

141 = i • 141 < |4| • 141 141 < <disc(L) 
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Thus the diameter of the fundamental cell P({£j}) is at most 

n 

<n\£ n \ <<disc(L) 

i=i 

as required. □ 

B.4. It will be useful to note that for integer dilates cL of a lattice L, 
c > 1, the diameter scales linearly: diam(cL) = cdiam(L), while the 
discriminant scales with c n : disc(cL) = c n disc(L). Thus to bound the 
diameter of a dilate of an integer lattice we use 

(B.5) diam(cL) <^C„ cdisc(L) 

B.5. The Lipschitz principle. 

Definition B.l. A set C C R" is of class m if the intersection of every 
line with C consists of at most m intervals (including the degenerate 
case when some of the intervals are points), and if the same is true for 
the projection of C on every linear subspace. 

Thus for instance a convex set is of class 1. 

We will use the following form of the "Lipschitz principle" from the 
geometry of numbers to estimate the number of lattice points in a 
region of R" : 

Lemma 16. Let L C Z™ be an integer lattice of discriminant disc(L), 
and C C R n a set of class m (e.g. a convex set). Suppose that C lies 
in a ball of radius R around the origin. Then 

(B.6) * {LnC) = ^T) + ° {Rn ~ 1) 

This follows from the Lipschitz principle for the integer lattice proven 



by Davenport ||, as adapted by W. Schmidt (]nj, Lemma 1). 

We will apply the Lipschitz principle to certain subsets of convex 
sets. For this purpose we will need: 

Lemma 17. Let C C R n be a convex set, d > and define 

C d := {xeC: dist(a;,<9C) > d} 

to be the set of points of C of distance at least d from the boundary dC 
of C. Then Cd is convex. 

Proof. What we need to show is that for any x%, x<i e Cd, and A G [0, 1], 
the point x 3 = X\ + A(x 2 — Xi) also lies in Cd, that is if \y\ < d then 
x 3 + y EC. But x 3 + y = (x 1 + y) + A((x 2 + y) - {x\ + y)), i.e. x 3 + y 
lies on a line between x\ + y and x<i + y. These two points lie in C since 
x\, x 2 G Cd- By convexity so does x 3 + y. □ 
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Appendix C. Counting small divisors 

In the paper, we need to use some estimates for the number of di- 
visors of q that are smaller than a fixed power of the mean spacing 
s. As is well known, the number of all divisors of q is 0(q e ) for all 
e > 0. This is not enough for our purposes, as we need a bound which 
is 0(s e ). This is provided by the following lemmas: 

Lemma 18. Let q be square-free, s = 2^ / 'a -i(q) ■ Fix a > 0. Then 
as s — > oo 

#{d\q:d<s a } = 0{s e ). 

for all e > 0. 

Proof. We start by bounding products of k distinct primes below by 
k k ; we may assume that the primes are the first k primes. Then by the 
Prime Number Theorem, 

k k 

log Y[pi = ^2 logpi ~ pk ~ k log k 

i=l i=l 

Exponentiating we see that the product is bounded below by k k . Now, 

#{d | q : d < s a } = ^ cij, 

j 

where aj = a(j,s a ,q) is the number of divisors of q that are smaller 
than s a and have precisely j prime factors. But if j > N, where N 
is the smallest integer such that N N > s a , then aj = 0. Moreover, 
setting w = u(q), we see that aj < ( w ). Hence 



d\q j<N 
d<s a 



By Stirling's formula, (^) <C mj^w- Thus 



(N/eY 

, , w \ ,r (we\ N ( N log(N)e 



N 



since N N > s a > 2 u ' a ( 1 ^) implies that w < ■ Thus 

{rf | g : d < S "} « N (^^Y « (ClogA)^ 
But the last term is clearly 0(s e ). □ 
Lemma 19. If a > toen < s- a+e 

d>s 
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Proof. We divide the sum into two parts: One over s < d < s R and 
the other over d > s R (R is a parameter chosen later). For the first, 
we use the fact that there are few (namely 0(s e )) divisors d of q with 
d < s R to bound that contribution by 

d\q d\q 

s<d<s R s<d<s R 

For the summands with d > s R , use d~ a < s~ Ra and r(q) = 2^ <C 
s 1+e to get 

d' a < s~ Ra r(q) < s 1 -^ 
Now choose R > so that 1 — i?a < —a to conclude the lemma. □ 
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